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Abstract 

With the proliferation of sequence data, great challenges are posed in the correct annotation of endogenous 
retroviruses, which together comprise up to ten per cent of the genomes of many organisms. It is therefore 
essential that all sources of information are carefully considered before drawing conclusions concerning the 
phylogeny, distribution and biological properties of endogenous retroviruses. We suggest that such due diligence 
has not been applied in the description of an endogenous ecotropic retrovirus that recently appeared in 
Retrovirology. 



Commentary 

We are writing in regard to the paper by Lee et al. [1], 
recently published in Retrovirology. We have a number 
of concerns about the accuracy of this paper that need 
to be considered. 

First we would like to ask whether the virus identified 
is novel or of intact coding potential. In the pioneering 
study by Jenkins et al. [2], they examined the endogen- 
ous ecotropic proviruses [Emv loci) present in the germ 
line of inbred mice. Different inherited proviruses were 
defined by Southern hybridization analysis using an eco- 
tropic murine leukemia virus-specific envelope gene 
probe (pEco) and DNA from different mouse strains cut 
with several different restriction enzymes to yield a vari- 
ety of virus-flanking sequence fragments. Different pro- 
viruses, integrated at different chromosomal locations, 
will yield characteristic sets of fragments [2], Emv2 was 
found in C57BL/6J and related strains. It was character- 
ized by pEco reactive fragments of 5.2, 7.0, 11.5, 17.0 kb 
in DNA digested with Pvull, Hindlll, Xbal and Bell 
respectively [2,3]. A number of genetic studies show 
that Emv2 maps to the distal region of mouse chromo- 
some 8 http://www.informatics.jax.org/searches/mapda- 
ta_report_by_marker.cgi?8452, and whilst not yet 
annotated on the C57BL/6J reference genome assembly, 
its location can be ascertained by BLASTn searches of 
the genomic sequence with the pEco probe sequence. 
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This reveals only a single match in this region of chro- 
mosome 8 for the entire C57BL/6J genome. Although a 
fraction of old C57BL/6 mice express ecotropic virus [4] 
and a poorly inducible ecotropic virus has been mapped 
to the distal region of chromosome 8 of both C57BL6 
[5] and C57BL/10 [6] mice, transfection studies clearly 
indicate that the Emv2 provirus is replication defective 
due to a specific base change in pol that results in an 
Ala to Pro change in RT [3,7,8]. Presumably correction 
of this defect is readily and routinely accomplished, per- 
haps by recombination with another endogenous virus 
[9]. 

In the paper by Lee et al. [1], an ecotropic provirus 
also mapping to chromosome 8 was found by mining 
the v37 build of the C67BL/6J genome sequence. They 
refer to this provirus as novel, calling it 'ERV mch8 ', and 
pointing to the absence of relevant annotation and 
sequence information in the NCBI databases to establish 
its relationship to Emv2. However, we note that the 
sequence predicts that 'ERV mc h8' would give rise to 
pEco reactive fragments of 4.9, 7.1. 11.2 and 15.1 kb fol- 
lowing digestion with Pvull, Hindlll, Xbal and Bell, it 
carries the G to C change resulting in the inactivating 
pol mutation and maps to the same region of mouse 
chromosome 8 that carries the only ecotropic murine 
leukemia provirus present in B6 mice. Thus, there 
seems no reason to invoke a novel ecotropic provirus. 
Further, to label this virus as possessing "intact coding 
potential" also seems misleading given its defective 
nature. 
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We would further like to point out the need for great 
care in classifying murine leukemia viruses for phyloge- 
netic purposes. In Figure one of the paper, a tree is shown 
with "polytropic" (U13766) and "xenotropic" (DQ399707) 
MLVs. Indeed, although these sequences are likely to 
encode viruses that are polytropic and xenotropic in host 
range, neither is endogenous. Instead, inspection of the 
NCBI annotations reveals that both are recombinants 
(U13766 is a class I MCF virus and thus likely to contain 
endogenous ecotropic, polytropic and xenotropic 
sequences see [10] and DQ399707 is XMRV clone VP62, 
most probably a polytropic/xenotropic recombinant [11]). 
Better choices for such an analysis would be found among 
the endogenous non-ecotropic ERV sequences listed in 
[12]. This problem is also seen in Figure two. It shows a 
comparison between four gag sequences, two of which are 
endogenous (Emvl and Emv2), and two that should not be 
thought of as endogenous but are replication competent 
viruses most probably derived by recombination between 
Emvl (Nerv) or Emvl (MelARV) and one or more non- 
ecotropic viruses [5,9]. Recombination has converted these 
viruses to B-tropism, allowing replication in A/J or B6 
derived cells or tissues, and may also be responsible for 
repairing the pol mutation. 

Jenkins et al. defined the strain distribution of differ- 
ent endogenous ecotropic proviruses, including Emv2, 
by Southern hybridization [2]. They found Emv2 only in 
C57BL/6J and a few closely related strains (C57BL/KsJ, 
C57BL/6ByJ, C57BL/10 J, C57BL/10SnJ, C57BR/cdJ). By 
contrast, Lee et al., using a PCR based approach and 
primers that are wrongly presumed specific for the 
'ERV mch8 ' sequence, found an ecotropic provirus on 
chromosome 8 in a wide variety of strains. Unfortu- 
nately, the primers selected both lie within the provirus 
itself, and sequence comparisons predict that they are 
unable to distinguish between Emv2 and at least two 
other Emvs {Emvl and Emvll). Unambiguous identifica- 
tion of specific proviruses by PCR is most easily accom- 
plished using one primer designed to the provirus and 
one corresponding to sequences flanking its integration 
site. Use of two internal viral primers is likely to give a 
positive signal with mouse DNAs containing any Emv 
proviruses. This fact makes the data presented in Figure 
three of Lee et al. regarding the distribution of endogen- 
ous ecotropic proviruses as well as any discussion based 
on them wholly unreliable. For example, their data indi- 
cate a shared ecotropic provirus in C3H/HeJ and 
C57BL/6J mice; this is inconsistent with the molecular 
demonstration of two independently segregating pro- 
viruses in BXH recombinant inbred mice [13]. Parenthe- 
tically, we also note that the use of the same primers in 
expression studies provides no control against the pre- 
sence of DNA contamination or amplification of unre- 
lated transcripts; it would have been highly desirable to 



use primers spanning the viral intron or starting mate- 
rial from a strain containing another Emv as controls. 

We consider that these shortcomings should have 
been obvious from a consideration of the appropriate 
literature, and we caution against the blind acceptance 
of genome annotations. One might have hoped these 
weaknesses would have been identified before 
publication. 
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